Analysis of Multi-Document Viewpoint Summarization Using Multi-Dimensional Genres

نویسندگان

  • Yohei Seki
  • Koji Eguchi
  • Noriko Kando
چکیده

An interactive information retrieval system that provides different types of summaries of retrieved documents according to each user’s information needs can be effective for understanding the contents. The purpose of this study is to build a multi-document summarizer to produce summaries according to such viewpoints. As an exploratory stage of investigation, we examined the effectiveness of genre for source documents to produce different types of summaries. Once a set of documents on a topic is provided to our summarization system, a list of topics discussed in the given document set is presented, so that the user can select a topic of interest from the list as well as the summary type, such as opinion-oriented, fact-reporting or knowledge-focused, according to their requirements. We assume a relationship between a summary type and human recognition of information types included in the source: a document genre. We also analyzed the results of the multidocument summarization using automatic genre classification to reveal the association between genre dimensions and the summary types.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The Next Step for Multi-Document Summarization: A Heterogeneous Multi-Genre Corpus Built with a Novel Construction Approach

Research in multi-document summarization has focused on newswire corpora since the early beginnings. However, the newswire genre provides genre-specific features such as sentence position which are easy to exploit in summarization systems. Such easy to exploit genre-specific features are available in other genres as well. We therefore present the new hMDS corpus for multi-document summarization...

متن کامل

Multi-Document Summarization By Sentence Extraction

This paper discusses a text extraction approach to multidocument summarization that builds on single-document summarization methods by using additional, available in-, formation about the document set as a whole and the relationships between the documents. Multi-document summarization differs from single in that the issues of compression, speed, redundancy and passage selection are critical in ...

متن کامل

Workshop on Automatic Summarization for Different Genres, Media and Languages

ive Summarization of Line Graphs from Popular Media Charles Greenbacker, Peng Wu, Sandra Carberry, Kathleen McCoy and Stephanie Elzer . . . . . . 41 Extractive Multi-Document Summaries Should Explicitly Not Contain Document Specific Content Rebecca Mason and Eugene Charniak . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 49

متن کامل

Multi-topic Based Query-Oriented Summarization

Query-oriented summarization aims at extracting an informative summary from a document collection for a given query. It is very useful to help users grasp the main information related to a query. Existing work can be mainly classified into two categories: supervised method and unsupervised method. The former requires training examples, which makes the method limited to predefined domains. While...

متن کامل

A Generative Approach for Multi-Document Summarization using the Noisy Channel Model

Multi-document summarization is the automatic production of a unique summary from a collection of texts. This task has become very important, since it assists the information processing in days where the amount of information is growing considerably. In this paper, we propose a statistical generative approach for multi-document summarization. In particular, we formulate the multi-document summa...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004